The 2nd International Workshop on Inductive Reasoning and Machine Learning for the Semantic Web Proceedings

نویسندگان

Claudia d’Amato

Nicola Fanizzi

Marko Grobelnik

Agnieszka Lawrynowicz

Vojtěch Svátek

Melanie Hilario

چکیده

I will describe a novel meta-learning approach to optimizingthe knowledge discovery or data mining (DM) process. This approach hasthree features that distinguish it from its predecessors. First, previousmeta-learning research has focused exclusively on improving the learningphase of the DM process. More specifically, the goal of meta-learning hastypically been to select the most appropriate algorithm and/or parame-ter settings for a given learning task. We adopt a more process-orientedapproach whereby meta-learning is applied to design choices at differentstages of the complete data mining process or workflow (hence the termmeta-mining). Second, meta-learning for algorithm or model selectionhas consisted mainly in mapping dataset properties to the observed per-formance of algorithms viewed as black boxes. While several generationsof researchers have worked intensively on characterizing datasets, littlehas been done to understand the internal mechanisms of the algorithmsused. At best, a few have considered perceptible features of algorithmslike their ease of implementation or their robustness to noise, or the in-terpretability of the models they produce. In contrast, our meta-learningapproach complements dataset descriptions with an in-depth analysisand characterization of algorithms their underlying assumptions, opti-mization goals and strategies, together with the structure and complexityof the models and patterns they generate. Third, previous meta-learningapproaches have been strictly (meta) data-driven. To make sense of theintricate relationships between tasks, data and algorithms at differentstages of the data mining process, our meta-miner relies on extensivebackground knowledge concerning knowledge discovery itself. For thisreason we have developed a data mining ontology, which defines the es-sential concepts and relations needed to represent and analyse data min-ing objects and processes. In addition, a DM knowledge base gathers as-sertions concerning data preprocessing and machine learning algorithmsas well as their implementations in several open-source software pack-ages. The DM ontology and knowledge base are domain-independent;they can be exploited in any application area to build databases de-scribing domain-specific data analysis tasks, datasets and experiments.Aside from their direct utility in their respective target domains, suchdatabases are the indispensable source of training and evaluation datafor the meta-miner. These three features together lay the groundwork forsemantic meta-mining, the process of mining DM meta-data on the basisof data mining expertise distilled in an ontology and knowledge base.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inductive Reasoning and Machine Learning for the Semantic Web

Claudia d’Amato aNicola Fanizzi a Marko Grobelnik b, Agnieszka Lawrinowicz c and Vojtech Svatek d a Department of Computer Sciece University of Bari, Italy E-mail: {claudia.damato | nicola.fanizzi}@uniba.it b Jozef Stefan Institute, Ljubljana, Slovenia E-mail: [email protected] c Poznan University of Technology, Poland E-mail: [email protected] d University of Economic...

متن کامل

Inductive learning for the Semantic Web: What does it buy?

Nowadays, building ontologies is a time consuming task since they are mainly manually built. This makes hard the full realization of the Semantic Web view. In order to overcome this issue, machine learning techniques, and specifically inductive learning methods, could be fruitfully exploited for learning models from existing Web data. In this paper we survey methods for (semi-)automatically bui...

متن کامل

Mining the Semantic Web Statistical Learning for Next Generation Knowledge Bases

In the Semantic Web vision of the World Wide Web, content will not only be accessible to humans but will also be available in machine interpretable form as ontological knowledge bases. Ontological knowledge bases enable formal querying and reasoning and, consequently, a main research focus has been the investigation of how deductive reasoning can be utilized in ontological representations to en...

متن کامل

Building Rules on top of Ontologies? Inductive Logic Programming can help!

Acquiring and maintaining Semantic Web rules is very demanding and can be automated though partially by applying Machine Learning algorithms. In this paper we show that the form of Machine Learning known under the name of Inductive Logic Programming (ILP) can help. In particular, we take a critical look at two ILP proposals based on knowledge representation frameworks that integrate Description...

متن کامل

DL-Learner - A framework for inductive learning on the Semantic Web

In this system paper, we describe the DL-Learner framework, which supports supervised machine learning using OWL and RDF for background knowledge representation. It can be beneficial in various data and schema analysis tasks with applications in different standard machine learning scenarios, e.g. in the life sciences, as well as Semantic Web specific applications such as ontology learning and e...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

The 2nd International Workshop on Inductive Reasoning and Machine Learning for the Semantic Web Proceedings

نویسندگان

چکیده

منابع مشابه

Inductive Reasoning and Machine Learning for the Semantic Web

Inductive learning for the Semantic Web: What does it buy?

Mining the Semantic Web Statistical Learning for Next Generation Knowledge Bases

Building Rules on top of Ontologies? Inductive Logic Programming can help!

DL-Learner - A framework for inductive learning on the Semantic Web

عنوان ژورنال:

اشتراک گذاری